Lag0s

Week Summary

Technology

Earth has captured a temporary 'second moon,' a small asteroid named 2024 PT5, which will orbit until November 2024.

Research indicates that larger AI chatbots are increasingly prone to generating incorrect answers, raising concerns about their reliability.

Meta's Chief Technical Officer discussed advancements in AR and VR technologies, particularly focusing on the Orion AR glasses.

The author reflects on their experience with Rust, proposing several changes to improve the language's usability and safety features.

The Tor Project and Tails OS have merged to enhance their efforts in promoting online anonymity and privacy.

OpenAI is undergoing leadership changes, with key executives departing amid discussions about restructuring and the company's future direction.

Git-absorb

The concept of critical mass explains how significant changes occur when a threshold of acceptance is reached, impacting technology and society.

WordPress.org has banned WP Engine from accessing its resources due to ongoing legal disputes, raising concerns about security for WP Engine customers.

PostgreSQL 17

Hotwire Native is a web-first framework that simplifies mobile app development, allowing developers to reuse HTML and CSS across platforms.

Radian Aerospace is progressing on a reusable space plane, completing ground tests and aiming for full-scale flights by 2028.

A groundbreaking diabetes treatment using reprogrammed stem cells has enabled a patient to produce insulin independently for over a year.

Apple is developing a new home accessory that combines features of the iPad, Apple TV, and HomePod, expected to launch in 2025.

SpaceX's Starlink service is set to surpass 4 million subscribers, reflecting rapid growth and significant revenue projections.

TinyJS is a lightweight JavaScript library that simplifies dynamic HTML element creation and DOM manipulation for developers.

Chatbots to increasingly feature ads, with startups already monetizing through advertising plugins.
Advertisements will inevitably become a regular feature in AI-powered chatbots. Smaller startups are already monetizing their chatbots through advertising plugins. The future of ad-driven chatbots will depend on how developers and marketers implement ads.
Md Impact
AI Chatbots
Advertising
Tuesday, March 19, 2024
Most consumers prefer waiting for human customer service over AI chatbots.
81% of consumers are willing to wait between 1 to 11 minutes to speak with a human customer service rep instead of chatting with an AI assistant. 22% of respondents are willing to wait 4-5 minutes. Humans are still perceived as better suited for complex problem-solving, one-call issue resolution, and empathetic interactions.
Hi Impact
AI Chatbots
General
Meta launches AI Studio for easy creation and sharing of AI chatbots, accessible across its platforms.
Meta's new AI Studio allows users to create, share, and discover AI chatbots easily, even without technical skills. The feature is rolling out to Instagram Business accounts in the US and will be available to all Meta users soon, accessible through the web, Instagram, Messenger, and WhatsApp.
Hi Impact
Meta AI Studio US AI Chatbots
Meta introduces AI Studio for easy creation of custom chatbots on social platforms.
Meta's new AI Studio tool will soon allow users without technical skills to create personalized AI chatbots for Instagram, Messenger, and WhatsApp. The tool will enable customized interactions with followers and full control over auto-replies.
Hi Impact
Meta AI Studio AI Chatbots
Concerns Over AI Chatbots' Reliability as Size Increases
Recent research has highlighted a concerning trend in the performance of larger artificial intelligence (AI) chatbots, revealing that as these models grow in size and complexity, they are increasingly prone to generating incorrect answers. This phenomenon is particularly troubling because users often fail to recognize when the information provided by these chatbots is inaccurate. The study, conducted by José Hernández-Orallo and his team at the Valencian Research Institute for Artificial Intelligence, examined three prominent AI models: OpenAI's GPT, Meta's LLaMA, and the open-source BLOOM model. The researchers analyzed how the accuracy of these models changed as they were refined and expanded, utilizing more training data and advanced computational resources. They discovered that while larger models generally produced more accurate responses, they also exhibited a greater tendency to answer questions incorrectly rather than admitting a lack of knowledge. This shift means that users are likely to encounter more incorrect answers, as the models are less inclined to say "I don't know" or to avoid answering altogether. The study's findings indicate that the fraction of incorrect responses has risen significantly among the refined models, with some models providing wrong answers over 60% of the time when they should have either declined to answer or provided a correct response. This trend raises concerns about the reliability of AI chatbots, as they often present themselves as knowledgeable even when they are not, leading to a phenomenon described as "bullshitting" by philosopher Mike Hicks. This behavior can mislead users into overestimating the capabilities of these AI systems, which poses risks in various contexts, especially when users rely on them for accurate information. To assess the models' performance, the researchers tested them on a wide range of prompts, including arithmetic, geography, and science questions, while also considering the perceived difficulty of each question. They found that while the accuracy of responses improved with larger models, the tendency to provide incorrect answers did not decrease proportionately, particularly for more challenging questions. This inconsistency suggests that there is no guaranteed "safe zone" where users can trust the answers provided by these chatbots. Moreover, the study revealed that human users struggle to accurately identify incorrect answers, often misclassifying them as correct. This misjudgment occurred between 10% and 40% of the time, regardless of the question's difficulty. Hernández-Orallo emphasized the need for developers to enhance AI performance on easier questions and encourage models to refrain from answering difficult ones, thereby helping users better understand when they can rely on AI for accurate information. While some AI models are designed to acknowledge their limitations and decline to answer when uncertain, this feature is not universally implemented, particularly in all-purpose chatbots. As companies strive to create more capable and versatile AI systems, the challenge remains to balance performance with reliability, ensuring that users can navigate the complexities of AI-generated information without falling prey to misinformation.
Hi Impact
OpenAI
Meta
AI Chatbots

Month Summary

Technology

OpenAI is considering a new subscription model for its upcoming AI product, Strawberry, while also restructuring for better financial backing.

Telegram founder

The startup landscape is shifting towards more tech-intensive ventures, with a focus on specialized research and higher capital requirements.

Boom Supersonic's XB-1 demonstrator aircraft successfully completed its second flight, testing new systems for future supersonic travel.

announced the uncrewed return of Boeing's Starliner, with future crewed missions planned for 2025.

OpenAI's SearchGPT aims to compete with Google Search by providing AI-driven information retrieval, though it currently faces accuracy issues.

Tesla is preparing to unveil its autonomous robotaxi technology at an event in Los Angeles, indicating ongoing challenges in achieving full autonomy.

The US Department of Justice is investigating Nvidia for potential antitrust violations related to its AI chip market dominance.

Apple plans to use OLED screens in all iPhone 16 models, moving away from Japanese suppliers and introducing new AI features.

Amazon S3 has introduced conditional writes to prevent overwriting existing objects, simplifying data updates for developers.

Chinese scientists have developed a hydrogel that shows promise in treating osteoarthritis by restoring cartilage lubrication.

Nvidia's CEO is working to position the Nvidia as a comprehensive provider for data center needs, amidst growing competition from AMD and Intel.

OpenAI

Nvidia Blackwell

Amazon is set to release a revamped Alexa voice assistant in October, powered by AI models from Anthropic's Claude, and will be offered as a paid subscription service.